NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

FuncPipe: A Pipelined Serverless Framework for Fast and Cost-Efficient Training of Deep Learning Models

https://doi.org/10.1145/3570607

Liu, Yunzhuo; Jiang, Bo; Guo, Tian; Huang, Zimeng; Ma, Wenhao; Wang, Xinbing; Zhou, Chenghu (December 2022, Proceedings of the ACM on Measurement and Analysis of Computing Systems)

Training deep learning (DL) models in the cloud has become a norm. With the emergence of serverless computing and its benefits of true pay-as-you-go pricing and scalability, systems researchers have recently started to provide support for serverless-based training. However, the ability to train DL models on serverless platforms is hindered by the resource limitations of today's serverless infrastructure and DL models' explosive requirement for memory and bandwidth. This paper describes FuncPipe, a novel pipelined training framework specifically designed for serverless platforms that enable fast and low-cost training of DL models. FuncPipe is designed with the key insight that model partitioning can be leveraged to bridge both memory and bandwidth gaps between the capacity of serverless functions and the requirement of DL training. Conceptually simple, we have to answer several design questions, including how to partition the model, configure each serverless function, and exploit each function's uplink/downlink bandwidth. In particular, we tailor a micro-batch scheduling policy for the serverless environment, which serves as the basis for the subsequent optimization. Our Mixed-Integer Quadratic Programming formulation automatically and simultaneously configures serverless resources and partitions models to fit within the resource constraints. Lastly, we improve the bandwidth efficiency of storage-based synchronization with a novel pipelined scatter-reduce algorithm. We implement FuncPipe on two popular cloud serverless platforms and show that it achieves 7%-77% cost savings and 1.3X-2.2X speedup compared to state-of-the-art serverless-based frameworks.
more » « less
Full Text Available
Joint Charging and Relocation Recommendation for E-Taxi Drivers via Multi-Agent Mean Field Hierarchical Reinforcement Learning

https://doi.org/10.1109/TMC.2020.3022173

Wang, Enshu; Ding, Rong; Yang, Zhaoxing; Jin, Haiming; Miao, Chenglin; Su, Lu; Zhang, Fan; Qiao, Chunming; Wang, Xinbing (April 2022, IEEE Transactions on Mobile Computing)

Full Text Available
Soil Moisture Sensing with UAV-Mounted IR-UWB Radar and Deep Learning

https://doi.org/10.1145/3580867

Ding, Rong; Jin, Haiming; Xiang, Dong; Wang, Xiaocheng; Zhang, Yongkui; Shen, Dingman; Su, Lu; Hao, Wentian; Tao, Mingyuan; Wang, Xinbing; et al (March 2022, Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies)

Wide-area soil moisture sensing is a key element for smart irrigation systems. However, existing soil moisture sensing methods usually fail to achieve both satisfactory mobility and high moisture estimation accuracy. In this paper, we present the design and implementation of a novel soil moisture sensing system, named as SoilId, that combines a UAV and a COTS IR-UWB radar for wide-area soil moisture sensing without the need of burying any battery-powered in-ground device. Specifically, we design a series of novel methods to help SoilId extract soil moisture related features from the received radar signals, and automatically detect and discard the data contaminated by the UAV's uncontrollable motion and the multipath interference. Furthermore, we leverage the powerful representation ability of deep neural networks and carefully design a neural network model to accurately map the extracted radar signal features to soil moisture estimations. We have extensively evaluated SoilId against a variety of real-world factors, including the UAV's uncontrollable motion, the multipath interference, soil surface coverages, and many others. Specifically, the experimental results carried out by our UAV-based system validate that SoilId can push the accuracy limits of RF-based soil moisture sensing techniques to a 50% quantile MAE of 0.23%.
more » « less
Full Text Available
Grad: Learning for Overhead-aware Adaptive Video Streaming with Scalable Video Coding

https://doi.org/10.1145/3394171.3413512

Liu, Yunzhuo; Jiang, Bo; Guo, Tian; Sitaraman, Ramesh K.; Towsley, Don; Wang, Xinbing (October 2020, ACM Multimedia Conference (MM'20))
null (Ed.)
Video streaming commonly uses Dynamic Adaptive Streaming over HTTP (DASH) to deliver good Quality of Experience (QoE) to users. Videos used in DASH are predominantly encoded by single-layered video coding such as H.264/AVC. In comparison, multi-layered video coding such as H.264/SVC provides more flexibility for up- grading the quality of buffered video segments and has the potential to further improve QoE. However, there are two challenges for us- ing SVC in DASH: (i) the complexity in designing ABR algorithms; and (ii) the negative impact of SVC’s coding overhead. In this work, we propose a deep reinforcement learning method called Grad for designing ABR algorithms that take advantage of the quality up- grade mechanism of SVC. Additionally, we quantify the impact of coding overhead on the achievable QoE of SVC in DASH, and propose jump-enabled hybrid coding (HYBJ) to mitigate the impact. Through emulation, we demonstrate that Grad-HYBJ, an ABR algo- rithm for HYBJ learned by Grad, outperforms the best performing state-of-the-art ABR algorithm by 17% in QoE.
more » « less
Full Text Available
SERENADE: A Parallel Iterative Algorithm for Crossbar Scheduling in Input-Queued Switches

https://doi.org/10.1109/HPSR48589.2020.9098995

Gong, Long; Liu, Liang; Yang, Sen; Xu, Jun Jim; Xie, Yi; Wang, Xinbing (May 2020, 2020 IEEE 21st International Conference on High Performance Switching and Routing (HPSR))

In an input-queued switch, a crossbar schedule, or a matching between the input ports and the output ports needs to be computed for each switching cycle, or time slot. It is a challenging research problem to design switching algorithms that produce high-quality matchings yet have a very low computational complexity when the switch has a large number of ports. Indeed, there appears to be a fundamental tradeoff between the computational complexity of the switching algorithm and the quality of the computed matchings. Parallel maximal matching algorithms (adapted for switching) appear to be a sweet tradeoff point in this regard. On one hand, they provide the following performance guarantees: Using maxi- mal matchings as crossbar schedules results in at least 50% switch throughput and order-optimal (i.e., independent of the switch size 𝑁 ) average delay bounds for various traffic arrival processes. On the other hand, their computational complexities can be as low as 𝑂 (log2 𝑁 ) per port/processor, which is much lower than those of the algorithms for finding matchings of higher qualities such as maximum weighted matching. In this work, we propose QPS-r, a parallel iterative switching algorithm that has the lowest possible computational complexity: 𝑂(1) per port. Yet, the matchings that QPS-r computes have the same quality as maximal matchings in the following sense: Using such matchings as crossbar schedules results in exactly the same aforementioned provable throughput and delay guarantees as using maximal matchings, as we show using Lyapunov stability analysis. Although QPS-r builds upon an existing add-on technique called Queue-Proportional Sampling (QPS), we are the first to discover and prove this nice property of such matchings. We also demon- strate that QPS-3 (running 3 iterations) has comparable empirical throughput and delay performances as iSLIP (running log 𝑁 itera- 2 tions), a refined and optimized representative maximal matching algorithm adapted for switching.
more » « less
Full Text Available
Data-Driven Pricing for Sensing Effort Elicitation in Mobile Crowd Sensing Systems

https://doi.org/10.1109/TNET.2019.2938453

Jin, Haiming; He, Baoxiang; Su, Lu; Nahrstedt, Klara; Wang, Xinbing (December 2019, IEEE/ACM Transactions on Networking)

Full Text Available
Dynamic Task Pricing in Multi-Requester Mobile Crowd Sensing with Markov Correlated Equilibrium

https://doi.org/10.1109/INFOCOM.2019.8737506

Jin, Haiming; Guo, Hongpeng; Su, Lu; Nahrstedt, Klara; Wang, Xinbing (April 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications)

Full Text Available

Search for: All records